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1.  Introduction 


Breast  cancer  is  a  heterogeneous  disease  in  terms  of  presentation,  morphology,  molecular  profile 
and  response  to  therapy.  Gene  expression  profiling  has  identified  six  molecular  subtypes,  i.e.  luminal  A, 
luminal  B,  normal  breast-like,  HER2+,  basal-like  and  claudin-low,  that  are  associated  with  clinical  markers 
as  well  as  prognosis  and  survival  [1-4].  However,  it  is  well  established  that  the  intrinsic  molecular  profiles 
of  breast  tumors  are  not  sufficient  to  perfectly  predict  disease  outcome.  Increasing  evidence  indicates 
that  characteristics  of  the  breast  stroma  and,  perhaps  more  specifically,  interactions  between  the  tumor 
epithelium  and  stroma  influence  breast  cancer  progression  and  response  to  therapy.  Previous  work  in 
our  lab  has  demonstrated  that  gene  expression  signatures  in  human  stroma  can  predict  outcome  of  breast 
cancer  patients  independently  of  clinical  parameters  and  molecular  subtypes  [5].  To  expand  on  these 
results,  the  goal  of  this  project  is  to  identify,  define  and  formally  test  critical  pathways  mediating  tumor 
epithelial-stromal  communication  and  co-dependency  in  Triple-Negative  (TN)  breast  cancer  (defined  as 
tumors  lacking  expression  of  estrogen  receptor,  progesterone  receptor  and  human  epidermal  growth 
factor  receptor-2  (HER2)),  a  subtype  associated  with  poor  outcome.  We  hypothesize  that  by  defining  the 
tumor  stromal  pathways  associated  with  poor  outcome  in  TN  tumors,  we  will  uncover  mechanisms  for  co¬ 
evolution,  biomarkers  and  potential  therapeutic  targets.  Our  specific  aims  are  to  develop  coordinate 
stromal-epithelial  expression  signatures  for  a  cohort  of  50  TN  breast  cancers  for  which  outcome  and 
follow-up  are  available,  to  identify  stromal-epithelial  gene  interaction  networks,  and  to  identify  and 
integrate  stromal-epithelial  gene  expression  and  microRNA  (miR)  signatures  associated  with  TN  breast 
tumors.  It  is  well  recognized  that  heterogeneity  is  a  key  factor  underlying  the  variability  in  patient 
response  to  treatment,  especially  in  TN  cases.  There  is  a  need  for  a  fuller  understanding  of  the  molecularly 
distinct  TN  subgroups  linked  to  outcome  and  the  development  of  more  personalized  treatment  strategies 
for  members  of  this  subgroup.  This  project  will  provide  the  first  integrated  in-depth  analysis  of  the 
contribution  of  tumor  stromal  processes  to  disease  heterogeneity,  and  will  position  the  tumor 
microenvironment  for  therapeutic  intervention.  This  project  also  promises  the  "next  generation"  of 
signatures  based  on  miR  that  are  stable  in  clinical  materials  and  can  be  developed  for  non-invasive  tests 
suitable  for  stratification  of  patients  for  chemotherapy,  monitoring  disease  progression  and,  in  the  long 
term,  for  early  detection  and  screening  for  metastatic  disease. 
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2.  Keywords 


Breast  cancer,  Triple-Negative,  epithelium,  stroma,  gene  expression,  microRNA,  laser  capture 
microdissection,  heterogeneity,  molecular  profiles,  tumor  microenvironment 

3.  Overall  Project  Summary 

3.1  Current  Objectives 

This  research  project  has  3  tasks  covering  3  years  (refer  to  Statement  of  Work  in  Appendix  1): 

1.  Develop  coordinate  stromal-epithelial  mRNA  expression  signatures  for  Triple-Negative 
(TN)  tumors. 

2.  Identify  stromal-epithelial  gene  interaction  networks. 

3.  Identify  and  integrate  stromal-epithelial  microRNA  (miR)  signatures  associated  with  TN 
breast  tumors. 

The  objectives  for  this  project  in  its  first  year  (2013-2014)  were  as  follows: 

•  Conduct  laser  capture  microdissection  (LCM)  to  isolate  separate  epithelial  and  stromal 
compartments  from  banked  tumor  samples,  both  tumor-associated  and  adjacent  normal 
tissues. 

•  Extract  RNA  from  epithelial  and  stromal  LCM  isolates  and  subject  to  microarray-based 
gene  expression  profiling. 

•  Profile  the  miR  expression  in  tumor  and  normal  epithelium  and  stroma. 

3.2  Results,  Progress  and  Acconnplishnnents 

During  this  first  year  of  the  project,  we  successfully  completed  our  first  goal  to  isolate  epithelial 
and  stromal  tissue  from  banked  TN  tumor  samples  via  LCM  (see  Figure  1  in  Appendix  2  for  a  depiction  of 
LCM).  My  Mentor,  Dr.  Morag  Park,  established  the  Breast  Cancer  Functional  Genomics  Group  (BCFGG)  in 
1999.  This  group  has  banked  fresh-frozen  breast  cancer  tumor  (approx.  700)  and  normal  (approx.  500 
including  matched  samples  and  reduction  mammoplasties)  tissue  samples  obtained  from  surgeries 
conducted  at  the  McGill  University  Health  Centre  (MUHC)  under  strict  quality  control  guidelines.  Blood 
samples  collected  at  the  time  of  surgery  have  been  processed  as  serum  and  plasma  and  stored.  Matched 
formalin-fixed  paraffin-embedded  (FFPE)  samples  from  the  clinical  pathology  archive  can  be  obtained 
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when  feasible  and  tissue  microarrays  for  banked  samples  have  been  constructed  to  aid  large-scale 
immunohistochemistry  and  in  situ  hybridization  analyses.  An  attending  clinical  pathologist  specializing  in 
breast  pathology  rescores  all  banked  samples  for  consistency.  HER2  fluorescence  in  situ  hybridization 
(FISH)  is  performed  to  confirm  HER2  status  in  equivocal  cases,  and  p53  mutation  analysis  is  conducted  for 
all  samples.  All  experimental  data  is  linked  to  information  regarding  pathology  analysis,  therapy  and 
disease  course.  Tissue  and  blood  collection  and  participant  follow-up  providing  outcome  is  conducted 
with  Research  Ethics  Board  approval.  Using  this  valuable  resource,  tumor  epithelial  and  stromal  tissues 
were  isolated  via  LCM  from  50  TN  patient  samples.  In  addition,  adjacent  normal  epithelial  and  normal 
stromal  tissues  were  isolated.  LCM  was  performed  as  previously  described  by  our  group  [6].  Briefly, 
human  breast  tumor  tissue  collected  from  consenting  patients  at  primary  surgery  was  snap-frozen  in 
Tissue-Tek  O.C.T.  (Sakura)  and  stored  in  liquid  nitrogen.  Blocks  were  sectioned  on  a  cryostat  as  5  pm 
sections  and  reviewed  by  an  experienced  attending  pathologist  specializing  in  breast  cancer  to  identify 
representative  regions  of  tumor  (TE),  tumor-associated  stroma  (TS),  histologically  normal  epithelium  (NE) 
and  histologically  normal  stroma  (NS)  (the  latter  two  distal  from  the  tumor).  Sections  (10  pm)  were  cut 
and  stained  using  the  Arcturus  Histogene  LCM  Frozen  Section  Staining  Kit  (Life  Technologies)  and 
representative  areas  were  isolated  by  LCM  using  an  Arcturus  PixCell  Me  instrument  (Life  Technologies).  All 
collection  was  performed  within  30  minutes  of  placing  the  slide  on  the  LCM  stage. 

We  also  successfully  completed  our  second  goal  for  the  first  year  of  this  project,  i.e.  to  extract 
RNA  from  epithelial  and  stromal  tissues  for  gene  expression  profiling.  RNA  was  extracted  from  epithelial 
and  stromal  LCM  isolates  and  subjected  to  microarray-based  gene  expression  profiling  via  Agilent 
SurePrint  G3  8x60K  chips  using  methods  based  on  those  previously  described  by  our  group  [5,  6].  Briefly, 
material  from  LCM  caps  for  each  sample  compartment  was  pooled  and  RNA  isolated  using  the  Arcturus 
PicoPure  RNA  Isolation  Kit  (Life  Technologies)  according  to  the  manufacturer's  directions.  Following 
quantification  of  yield  (Nanodrop  spectrophotometer)  and  quality  control  analysis  (Agilent  Bioanalyzer), 
samples  judged  as  of  sufficient  quantity  and  quality  were  subjected  to  2  rounds  of  amplification  using  the 
Arcturus  RiboAmp  HS  Plus  kit  (Life  Technologies)  according  to  the  manufacturer's  directions.  Resulting 
amplified  RNA  was  subjected  to  quality  control  assay  (Agilent  Bioanalyzer),  labelled  with  Cy3,  and 
hybridized  to  Agilent  SurePrint  G3  8x60K  Human  Gene  Expression  arrays  together  with  a  Cy5-labelled 
common  reference.  Hybridization  and  washing  were  carried  out  according  to  the  manufacturer's 
directions.  Subsequently,  arrays  were  scanned  on  an  Agilent  Microarray  Scanner  and  feature-extracted 
using  Agilent  FE  software.  A  preliminary  heatmap  of  the  gene  expression  data  demonstrates  that  the 
samples  typically  cluster  according  to  tissue  type  (Figure  2  in  Appendix  2). 


The  third  goal  for  the  first  year  of  this  project  was  to  profile  the  miR  expression  in  tumor  and 
normal  epithelium  and  stroma.  We  initially  proposed  to  profile  the  miR  expression  using  the  NanoString 
platform  available  at  the  Innovation  Centre  (McGill  University).  As  a  result  of  technical  difficulties,  we 
chose  an  alternate  platform,  i.e.  TaqMan  LDA  plate  assays  at  the  Institute  for  Research  in  Immunology 
and  Cancer  (IRIC)  at  Universite  de  Montreal.  Analysis  is  being  performed  by  the  McGill-Genome-Quebec 
Innovation  Centre.  The  miR  is  prepared  from  LCM  isolates  as  follows.  Material  from  LCM  caps  for  each 
sample  compartment  is  pooled  and  resuspended  in  300  pL  RLT  buffer  before  being  loaded  on  a 
Qiashredder  column  (Qiagen)  and  centrifuged  at  14  000  rpm  for  2  minutes.  Flowthrough  is  loaded  on  a 
Qiagen  AllPrep  DNA  spin  column  and  centrifuged  at  10  000  rpm  for  30  seconds.  Flowthrough  is  combined 
with  30  pL  2  M  sodium  acetate  pH  4.0,  330  pL  water-saturated  phenol  and  90  pL  chloroform-isoamyl 
alcohol  (23:1).  After  vortexing,  the  mixture  is  incubated  on  ice  for  15  minutes  and  centrifuged  at  12  000 
rpm  for  15  minutes.  The  upper  phase  (200  pL)  is  transferred  to  a  new  tube  and  1.5  pL  GlycoBlue  (Ambion; 
resuspended  at  100  pg/mL  in  isopropanol)  and  200  pL  isopropanol  are  added.  After  mixing  by  inversion 
(lOx),  the  mixture  is  incubated  at  -80°C  overnight,  then  centrifuged  for  30  minutes  at  4°C  (12  000  rpm)  to 
pellet  RNA.  Pellets  are  washed  twice  with  400  pL  ice-cold  75%  ethanol  and  air-dried  for  15  minutes.  Air- 
dried  pellets  are  resuspended  in  10  pL  ddH20  then  thoroughly  combined  with  250  pL  RLT  buffer.  Ethanol 
(390  pL  of  100%,  equating  to  1.5  volumes)  is  added  and  mixed  by  pipetting.  The  entire  mixture  is  loaded 
onto  a  RNeasy  MinElute  Spin  Column  (Qiagen)  in  a  2  mL  collection  tube  and  centrifuged  for  15  seconds  at 
10  000  rpm.  The  column  is  washed  twice  with  500  pL  Buffer  RPE  (Qiagen)  and  dried  by  centrifugation  at 
14  000  rpm  for  5  minutes.  RNA  is  eluted  from  the  column  with  20  pL  RNAse-free  ddH20.  The  RNA  is  re¬ 
applied  to  the  column  and  centrifuged  again  at  14  000  rpm  for  1  minute  to  elute  any  remaining  RNA. 
Extracted  RNA  is  quantified  using  a  spectrophotometer  (Nanodrop)  and  subjected  to  BioAnalyzer  to  assay 
for  quality  (Agilent  Technologies).  Total  RNA  (150-200  ng)  is  subjected  to  a  pre-amplification  step  using 
the  TaqMan  MegaPlex  PreAmp  primer  pool  and  the  pre-amplified  products  are  assayed  using  TaqMan 
LDA  384-well  plates  (Pools  A  and  B)  on  an  ABI  7900HT  Fast  Real-Time  system.  Currently,  the  miR  analysis 
is  ongoing  and  should  be  completed  for  all  50  TN  samples  by  the  end  of  November  2014.  This  slight  delay 
is  not  expected  to  affect  our  ability  to  investigate  miR  signatures  for  their  prognostic  value  by  using  linked 
patient  outcome  data  and  validate  miR  of  interest  as  scheduled  in  the  Statement  of  Work  (Appendix  1). 

3.3  Discussion 

To  increase  our  understanding  of  tumor  heterogeneity  within  the  TN  subtype  and  the  association 
of  stromal  subtypes  with  outcome,  we  proposed  to  profile  the  gene  expression  of  tumor  epithelium  and 
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associated  stroma  as  well  as  matched  normal  epithelium  and  stroma  and  apply  a  ''class  discovery" 
bioinformatics  approach  to  identify  stromal  subgroups.  The  first  two  objectives  for  this  project  in  its  first 
year  were  to:  (i)  conduct  LCM  to  isolate  separate  epithelial  and  stromal  compartments  from  banked  tumor 
samples,  and  (ii)  extract  RNA  from  epithelial  and  stromal  LCM  isolates  for  microarray-based  gene 
expression  profiling.  These  two  objectives  have  been  successfully  completed  and  are  the  first  steps  in  the 
larger  task  of  developing  coordinate  stromal-epithelial  mRNA  expression  signatures  for  TN  tumors. 
Preliminary  analysis  of  the  gene  expression  data  indicates  that  the  samples  cluster  according  to  tissue 
type;  moreover,  that  normal  and  tumor  epithelia  cluster  together  as  do  normal  and  tumor  stroma  (Figure 
2  in  Appendix  2).  With  these  results,  we  are  now  prepared  for  the  next  and  final  step  of  our  first  task  as 
detailed  in  the  Statement  of  Work  (Appendix  1),  i.e.  to  identify  stromal  subclasses  in  order  to  characterize 
normal  and  tumor  stroma  and  corresponding  epithelia  and  reveal  relevant  tumor-associated  changes  and 
epithelial-stromal  gene  expression  networks. 

Changes  in  miR  expression  have  been  documented  in  breast  cancer  [7-10],  and  several  of  these 
have  been  shown  to  be  associated  with  clinical  features  [7, 11-14]  including  response  to  therapy  [15-17]. 
However,  little  is  known  regarding  the  prognostic  value  of  miR  sets  in  tumor  stroma,  particularly  in  TN 
breast  cancer.  The  third  objective  for  this  project  in  its  first  year  was  to  profile  the  miR  expression  in 
tumor  and  normal  epithelium  and  stroma.  We  expect  this  objective  to  be  met  by  the  end  of  November 
2014.  This  will  allow  us  to  complete  our  next  objectives  of  linking  miR  signatures  to  patient  outcome  and 
validating  miR  of  interest  as  biomarkers  in  the  larger  task  of  identifying  and  integrating  stromal-epithelial 
miR  signatures  associated  with  TN  breast  cancer  as  scheduled  in  the  Statement  of  Work  (Appendix  1). 

4.  Key  Research  Accomplishments 

Nothing  to  report. 

5.  Conclusion 

Nothing  to  report. 

6.  Publications,  Abstracts  and  Presentations 

Nothing  to  report. 
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7.  Inventions,  Patents  and  Licenses 


Nothing  to  report. 

8.  Reportable  Outcomes 

Nothing  to  report. 

9.  Other  achievements 

9.1  Training  and  Professional  Developnnent 

As  the  Principal  Investigator  on  this  project,  I  have  had  the  opportunity  to  train  in  new  techniques 
and  improve  my  professional  skills  over  the  past  year.  My  collaborators,  Dr.  Nicholas  Bertos  and  Dr.  Hong 
Zhao,  are  key  members  of  the  BCFGG  with  experience  in  tissue  banking,  tissue  microdissection,  expression 
profiling  and  target  validation.  With  their  guidance,  I  have  learned  how  to  perform  LCM,  how  to  extract 
RNA  and  miR  from  LCM  isolates,  and  how  to  perform  microarray-based  gene  profiling  as  proposed  in  the 
Statement  of  Work  (Appendix  1).  In  addition,  I  have  met  routinely  with  Dr.  Bertos  and  my  mentor.  Dr. 
Park,  to  discuss  technical  and  theoretical  aspects  of  the  project  as  well  as  budgetary  concerns.  I  have 
learned  how  to  use  the  financial  systems  in  place  at  McGill  University  to  monitor  and  control  my  research 
funds.  These  meetings/training  have  contributed  to  my  training  in  project  management. 

My  project  location,  the  Goodman  Cancer  Research  Centre  at  McGill  University,  runs  a  weekly 
'work  in  progress'  seminar  series  at  which  graduate  students  and  postdoctoral  fellows  present  their  work. 
In  addition,  invited  external  speakers  present  their  current  research  at  regular  seminars.  Many  of  these 
researchers  are  working  on  breast  cancer  projects  and  these  seminars  are  keeping  me  abreast  of  current 
trends  in  the  field.  They  also  provide  opportunities  for  collaborations  or  additional  training. 

Trainees  at  the  Goodman  Cancer  Research  Centre,  through  the  McGill  Integrated  Cancer  Research 
Training  Program  (MICRTP)  as  well  as  the  Systems  Biology  Training  Program,  have  access  to  workshops 
such  as  development  of  hypothesis  and  grant  writing,  time  management,  effective  oral  and  visual 
communication,  advanced  statistical  analysis,  ethics,  knowledge  translation  and  bioinformatics.  This  year, 
I  took  a  course  in  the  bioinformatics  programming  language,  R.  This  will  assist  me  in  understanding  and 
manipulating  the  gene  expression  and  miR  profile  data,  and  help  me  to  communicate  more  effectively 
with  the  bioinformatics  personnel  on  this  project. 
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This  year,  I  also  trained  and  mentored  a  fourth-year  undergraduate  in  his  student  research 
project.  This  is  an  important  piece  of  my  training  because  as  a  professor  with  my  own  lab,  I  will  be  training 
and  directing/mentoring  undergraduate  and  graduate  students. 

Finally,  because  I  am  interested  in  connecting  basic  research  with  the  clinic,  I  attended  the 
Translational  Cancer  Research  for  Basic  Scientists  Workshop  offered  by  the  American  Association  for 
Cancer  Research  (AACR)  in  November  2013  in  Boston  MA.  This  workshop  covered  topics  such  as 
diagnostics,  clinical  trials,  regulatory  requirements,  personalized  medicine  and  translational 
collaborations.  In  addition  to  lectures  and  small  group  discussions,  this  workshop  offered  the  unique 
opportunity  to  observe  and  interact  with  health  professionals  in  various  clinical-related  settings.  All 
participants  visited  a  surgical  pathology  laboratory,  a  diagnostic  radiology  laboratory,  patient  clinics,  and 
an  Institutional  Review  Board  (IRB)  meeting  at  Massachusetts  General  Hospital  or  the  Dana-Farber  Cancer 
Institute.  These  on-site  sessions  included  shadowing  doctors  meeting  with  their  patients.  In  order  to 
participate  in  this  workshop,  trainees  had  to  pass  an  online  course  in  ethics  (working  with  human  subjects) 
offered  by  the  Dana-Farber  Cancer  Institute.  This  was  very  informative  and  relevant  to  my  project  as  I  am 
working  with  patient  samples.  Overall  the  experience  was  very  inspiring  and  expanded  my  vision  as  a 
breast  cancer  researcher. 
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11.  Appendices 


11.1  Statement  of  Work 


Statement  of  Work 

Note:  All  work  will  be  perfomied  at  the  Cioodnian  Cancer  Research  Institute,  1160  Des  Pins  Avenue  West, 
MontrcaL  Quebec,  Canada,  IB  A  I  A3  unless  .specified.  TTie  Principal  Investigator  (PI)  is  Dr.  Crista  Tlionipson 
and  the  Metitor  is  l>r.  Morag  Park. _ 


Task  Description  |  Year  1  |  Year!  |  Year 3 

1.  Develop  coordinate  stromal-epUheUal  mRNA  evpret^sion  signatures  for  Triple-negative  (TX)  tumors. 

*  Resource;  Dr.  Park  established  the  Rreast  Cancer  Functional  Genomics  Group.  Tliis  group  has  banked 
fresh-frozen  breast  cancer  tumor  (approx.  400)  and  normal  (approx.  500  including  matched  samples  and 
reduction  mmnmoplaslies)  tissue  samples  obtained  from  surgeries  conducted  at  the  McGill  University 
Health  Centre  under  strict  quality  control  guidelines.  Blood  samples  collected  at  the  time  of  surgery  have 
been  processed  as  serum  and  plasma  and  stored.  Matched  formalin-fixed  paraffm-em bedded  (FFPIQ 
samples  from  the  clinical  pathology  archive  can  be  obtained  wiien  feasible  and  tissue  microarrays  for 
banked  samples  have  boon  constructed  to  aid  large-scale  ITTC  and  in  situ  hybridization  analyses.  An 
attending  clinical  pathologist  specializing  in  breast  pathology  rescores  all  banked  samples  for 
consistency.  IIER2  Fluorescence  in  situ  h>'bridization  is  perlbrmcd  to  conflnn  IIER2  status  in  equivocal 
cases  and  p53  mutation  analysis  is  conducted  for  all  samples.  All  experimental  data  is  linked  to 
information  regarding  pathology  analysis,  therapy  and  disease  course.  I’issue  and  blood  collection  and 
participant  follow-up  providing  outcome  is  conducted  with  Research  Ethics  Board  approval. 

la.  Conduct  laser  capture  microdissection  (1  CM)  to  isolate  separate  epithelial 
and  stromal  conipaitments  from  banked  tumor  samples,  both  liimor-associated 
and  adjacent  nonnal  tissues. 

*  Collaborator/ Personnel;  Dr.  Nicbola.s  Bertos  /  Hong  Zhao 

*  Samples  from  30  'FN  patients  with  distant  recurrence  within  5  ycat^ 
and  20  'FN  patients  with  no  recurrence  in  5  years  will  be  anal^^-zed. 
llierefore,  there  will  be  a  total  of  200  analyses  (50  samples  >«  4  tissue 
c  om  partments/  s  ampl  c). 

*  PI  Training:  Lcam  how  to  pertbnn  LCM. 

Months 

1-8 

lb.  Exlraci  RNA  from  epithelial  ;ind  slromal  LCM  isolates  and  subject  to 
microarray- based  gene  expression  profiling. 

*  Collaboralor/Personnek  Dr.  Nicholas  Berios  HongZliao 

*  Profiling  will  be  performed  with  Agilent  Whole  Human  Cienome 
4x44 K  chips 

*  PI  draining:  l^am  how^  to  extract  RNA  from  LCM  isolates. 

*  PI  Training:  Tjcam  how  to  perform  microarray-based  gone  profiling. 

Months 

6-12 

Ic.  Ideiilify  slromal  subclasses. 

*  Collaboralor/Personnel:  Dr.  Michael  Hallett  Sadiq  Saleh 

*  Methods:  Cienes  defining  stromal  subclasses  will  demonstrate 
homogeneous  expression  witliin  the  corresponding  cluster,  as  well  *is 
helerogeneous  expression  outside  Ihe  cluster  as  delemiined  by  varimice 
component  analysis,  lire  biological  fimctions  over-represented  in  each 
stroma  class  will  be  identified  by  perfonning  gene  set  enriclimenl 
analysis  and  testing  for  enrichment  against  multiple  ontological 
databases  including  Gene  Gnlology  (fiO),  the  Kyoto  encyclopedia  of 
genes  and  genomes  (KEGG)  and  Lisl2List  (L2L). 

*  PI  draining;  I^eam  about  class  discovery  and  gene  set  enrichment 
analysis. 

Months 

1-6 

Mil&stone:  Complete  characterization  of  pro/iles  in  matched  normal  and  tumor  stroma  and  corresponding 
epithelia  to  reveal  relevant  tumor^associated  changes  and  epithelial-stromal  gene  expression  networks. 
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Fask  lle^crlptlon  | 

Year  1  | 

1  Year  2  | 

1  Year  3 

2.  Identify  stromal -epithelial  gene  interaction  networks. 

2a.  Develop  a  de  novo  bioiiiibnnatics  tool,  STR-EPI,  to  identify  genes 
modulating  cross-talk  between  tumor  epithelium  and  tumor- associated 
stromal  components, 

*  CoUahorator/Pcrsoiincl:  Dr.  Michael  Hallclt  /  Sadiq  Saleh 

*  Resources:  A  comprehensive  database  of  >  1600  breast  cancer  specific 
gene  signatures  (BreastSigDB).  Tliese  include  both  signatures  from  the 
literature  as  well  as  those  eontained  in  public  databases  such  as 
MsigBD. 

*  Methods:  We  will  develop  a  stromal-epithelial  interaction  map  for  each 
prominent  subtjpe  combination  identified  in  task  1  using  a  variety  of 
established  and  new  infomiatics  tools. 

Months 

6-12 

Months 

1-3 

Milestone:  Development  of  a  new  bioinformatics  tool  STR-EPI  to  identify  stromal-epithelial  gene  signatures. 

2b.  Characterize  epithelial -stromal  subtypes  specifically  associated  with  good 
or  pot>r  response  to  ehcniolhcrapy. 

*  Collaborator/Persoimel:  Dr.  Michael  Hallett  /  Sadiq  Saleh 

*  Resource:  We  have  generated  a  human  gene  expression  data 
compendium  derived  from  22  publicly  available  datasets  that  contained 
patients  diagnosed  with  invasive  duelal  carcinoma  with  associated 
clinical  infonnalion,  including  recurrence  status  (defined  as  distant 
metastasis  within  5  years),  survival,  and  immunohistochemistry  results 
(cunently  n  5 1 7  5  patients  containing  619  TN  p  atlents ). 

*  Methods:  Within  the  stromal  and  epithelial  datasets,  each  gene  present 
will  be  ranked  as  a  univariaic  predictor  of  reeurrenee  using  a 
parametric  test.  Ihese  predictors  will  be  trained  using  a  Naive  Bayes 
Classifier  and  crossvali dated  under  a  leave -one- out  cross-validation 
scheme.  Ihc  signature  will  be  re-trained  in  our  data  and  validated  using 
the  same  procedure  in  new  and  existing  gene  expression  datasets  with 
out  come  following  treatment  t<>  an  anthraeyelinc-  and- or  taxanc-based 
regimens  utilizing  our  breast  cancer  gene  expression  compendia 
mentioned  above. 

Months 

3-6 

2c.  Validate  STR-EPI  outcome  predictors. 

•  Col  1  ah  orator/  Perscjnn  el:  Dr.  N  i  eh  olas  1  iertos  /  i  1  on  g  Zhao 

•  Methods:  Outcome  predictors  will  be  validated  by  reverse  transcriptase 
PCR  and  IHC/j>?  situ  hybridization  using  available  matched  frozen 
and/or  archival  FFPE  tissue 

•  Methods:  Results  will  also  be  validated  with  a  tissue  mieroarray  (TM  A) 
composed  of  samples  from  "-500  patients  treated  at  the  MeCHll 
Univeisity  Health  Centre  with  5-year  follow-up  infoimation. 

Months 

7-12 

Milestone:  Identi  fication  and  validation  of  candidate  genes,  pathway. s  and  interaction  pairs  with  prognostic 
and/or  interventional  applicability. 
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Task  De^cnptioii  |  Year  1  |  Year  2  |  Year  3 

3.  Identify  and  integrate  stmnial-epitheliai  miRNA  (miK)  signatures  associated  with  TN  breast  tumiirs. 

3a.  Profile  tlie  niiR  expression  in  tumor  and  noniial  epithelium  and  stroma. 

•  Coll  ab  or  at  or/  Pcrsomie  1 :  Dr.  N  i  ch  0  las  Bcrtos  '  1 1  oiig  Zhao 

•  Methods;  miR  will  he  isolated  IVoin  our  LCM  samples  s pee i Tied  in 

Task  1.  The  concentration  will  be  assessed  and  quality  control 
perfomied  by  Nano  drop  speclrophotometer  and  Bioanalj'zer  analyses. 
Die  miR  expression  wdll  be  profiled  using  the  NaooString  platfomi 
available  at  the  hinovaiioii  Centre  (MeCiill  University). 

Reproducibility  will  be  assessed  by  quantile  nonnalization  orhiological 
replicates  and  the  mean  normalized  signal  from  biological  replicates 
will  be  used  for  comparative  expression  analysis. 

•  PI  'fraining:  Leani  bow^  to  extract  miR  from  IX^M  isolates. 

Moiitlis 

6-12 

Milestone:  Collection  of  miR  expression  profiles  in  tumor  and  normal  epithelium  and  stroma. 

3b.  bivestigate  miR  signatures  for  their  prognostic  value  by  using  linked 
patient  outcome  data. 

*  Collaborator/ Personnel:  Dr.  Miebael  Hallcll  /  Sadiq  Saleh 

*  Methods:  Dilferentially  expressed  miR  between  nomial  and  tumor 
tissues  (epithelium-  or  stroma-derived)  will  be  identified  using  one-way 
analysis  of  variance  (AN OVA,  p<0.5)  and  hierarchical  clustering  with 
Pearson  ct>rrelalion  using  the  lop  50  most  variably  expressed  miR. 
Ditfe rent] ally  expressed  miR  between  stromal  or  epithelial  samples  will 
be  identified  at  a  threshold  of  P  <  1  x  10-5,  using  the  LIMMA  package 
in  Bioconductor.  Die  miR  signatures  will  be  evaluated  for  their 
prognostic  value  using  linked  patient  outcome  data. 

*  PI  Training:  Learn  how  to  link  miR  signatures  to  patient  outcome. 

Months 

6-12 

3c.  Validate  miR  of  interest. 

•  Collaborator/Personnel;  Dr.  Nicholas  Bertos  /  Hong  Zhao 

•  Methods:  miR  of  interest  will  be  validated  via  situ  hybridization  on 
FFPE  sections  specified  in  Task  1. 

•  Methods:  PC R -based  assays  for  any  miR  that  correspond  with  tumor 
subtypes  we  previously  identified  will  be  established  such  that  the  miR 
can  be  used  as  biomarkers  in  TN  breast  cancer  patients. 

•  PI  Training:  Learn  how  to  quantify'  miR  using  PCR-based  tests  or  in 
situ  hybridization. 

Months 

1-12 

Milestone:  Identification  and  validation  of  miR  signatures  with  prognostic  value. 
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11.2  Figures 


Nature  Reviews  I  Genetics 

Figure  1:  Laser  capture  microdissection  (LCM)  is  a  technology  for  rapid  and  easy  procurement  of 
a  microscopic  and  pure  cellular  subpopulation  away  from  its  complex  tissue  milieu,  under  direct 
microscopic  visualization.  The  starting  material  can  be  frozen,  or  fixed,  and  stained.  A  thin  polymer  film  is 
placed  in  direct  contact  with  a  frozen  or  fixed  tissue  section  and  a  laser  beam  activates  the  polymer  and 
so  transfers  the  selected  cell(s)  out  of  the  tissue  and  onto  the  polymer  film.  This  positive  selection  method 
is  done  repeatedly  until  all  of  the  desired  tissue  is  embedded  onto  the  polymer  film.  An  extraction  buffer 
is  applied  to  the  polymer  film  so  that  DNA,  RNA  or  proteins  can  be  solubilized  from  the  captured  tissue 
cells.  LCM  fully  preserves  the  state  of  the  cell's  molecules  for  quantitative  analysis.  Adapted  from  [18]. 
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Color  Key 


Samples  cluster  according  to  tissue  type 


Figure  2:  Hierarchical  clustering  (unsupervised)  of  gene  expression  data  from  tumor  and  normal 
epithelium  and  stroma.  Shown  are  the  most  variable  genes  with  an  interquartile  range  (IQR)  greater  than 
2.  Samples  typically  cluster  according  to  tissue  type.  Tissue  type  =  tumor  epithelium  (red),  normal 
epithelium  (pink),  tumor  stroma  (blue),  normal  stroma  (light  blue). 
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